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ABSTRACT 

We present a study of rest-frame UV-to-optical color distributions for z ^ 4 galaxies based on the 
combination of deep HST/ACS-I-WFC3/IR data with Spitzer/IRAC imaging. In particular, we use 
new, ultra-deep data from the IRAC Ultradeep Field program (lUDFlO). Our sample contains a total 
of ~ 2600 galaxies selected as i?-dropout Lyman Break Galaxies (LBGs) in the HUDF and one of 
its deep parallel fields, the HUDF09-2, as well as GOODS-North and South. This sample is used 
to investigate the UV continuum slopes (3 and Balmer break colors (J125 — [4.5]) as a function of 
rest-frame optical luminosity. The [4.5] filter is chosen to avoid potential contamination by strong 
rest-frame optical emission lines. We find that galaxies at < —21.5 (roughly corresponding to 
^*~4) significantly redder than their lower luminosity counterparts. The UV continuum slopes 
and the J125 — [4.5] colors are well correlated. The most simple explanation for this correlation is that 
the dust reddening at these redshifts is better described by an SMC-like extinction curve, rather than 
the typically assumed Calzetti reddening. After correcting for dust, we find that the galaxy population 
shows mean stellar population ages in the range 10®'^ to 10^ yr, with a dispersion of ~0.5 dex, and 
only weak trends as a function of luminosity. In contrast to some results from the literature, we find 
that only a small fraction of galaxies shows Balmer break colors which are consistent with extremely 
young ages, younger than 100 Myr. Under the assumption of smooth star-formation histories, this 
fraction is only 12-19% for galaxies at < —19.75. Our results arc consistent with a gradual build- 
up of stars and dust in galaxies at z > 4 with only a small fraction of stars being formed in short, 
intense bursts of star-formation. 

Subject headings: galaxies: evolution — galaxies: formation — galaxies: high-redshift — galaxies: 
luminosity function, mass function 



1. INTRODUCTION 

Great progress has been made over the last decade 
in exploring galaxies at very early times. Large optical 
and NIR surveys with HST, such as the HUDF/HUDF09 
(iBeckwith et al.l l2006': 'Oesch et al."2007^ 'Bouwcns et"all 
2011h. GOODS/CANDELS ( Giavahsco ct al. 20041 



Grogin et al. 
(|Trenti et al 





2011 




2011 



Koekemoer et aLllmiD . or BORG 
^dlev et aLn20lir allowed us to 
identify large numbers of galaxies at z ~ 4 — 8 which 
are used to study the galaxy build-up in the first 1.5 Gyr 
of cosmic time. 

One of the main shortcomings of HST surveys, how- 
ever, is that they only probe the rest-frame UV light of 
high-redshift galaxies. Even the recently installed NIR 
camera WFC3/IR only probes to < 2600 A at z > 4 
and the crucial rest-frame optical wavelengths are out of 



^ Based on data obtained with the Hubble Space Telescope op- 
erated by AURA, Inc. for NASA under contract NAS5-26555. 
Based on observations with the Spitzer Space Telescope, which is 
operated by the Jet Propulsion Laboratory, California Institute 
of Technology under NASA contract 1407. 

^UCO/Lick Observatory, University of California, Santa 
Cruz, CA 95064; poesch@ucolick.org 

^ Leiden Observatory, Leiden University, NL-2300 RA Leiden, 
Netherlands 

* University of California, Riverside, CA, USA 

^ Institute of Astronomy, University of Cambridge, Madingley 
Road, Cambridge CBS OHA, UK 

® Department of Astronomy, Yale University, New Haven, CT 
06520 

t Hubble Fellow 



reach of HST. The observed light of high-redshift galaxies 
is thus strongly susceptible to dust extinction. 

The only way to efficiently probe the rest-frame op- 
tical light of faint z > 4 galaxies is with the Spitzer 
Space Telescope, which samples up to ~ 9000 A at 
z > 4 in its 4.5 /im channel. Several surveys have ob- 
tained deep imaging in the [3.6] and [4.5] bands over a 
number of fields to cover the rest-frame optical of the 
large samples of high-redshift galaxies that were iden- 
tified with HST. Such data are crucial for an unbiased 
study of the galaxy stellar mass build-up at z > 4. They 
were already used to measure stellar population proper- 
ties and galaxy masses a t z ^ 4 — 7 (e.g. Eyles et aJ] 



2007: Verma et al 



2009£.Labbe et al 



[20071 IStaFk et al ' 2009; Yabe et all 
2010by: iGonzalez et al.. ,2010. ,2012a 



Gonzalez et all l2012bl: iLee et all I2qi2bf). and recen tlv 
even out to z ^ 8 (jLabbe et al T l2012l : IYan el^Mui) . 

Despite some debate in the early literature, the rest- 
frame UV data alone indicate that UV bright galaxies at 
z > 2, on average, show redder UV spectral slopes than 
their lower luminosity counterparts (e.g. iReddv et al.l 



- rpS- ^ , , , , 

20081: iBouwens et al .l'2009'. ^m^; 'Castell aiio et al.l 120121 : 
Dunlop et all l2012t Finkelstcin ct al. 20l3). It is now 



clear that the galaxies at the bright end of the UV LF 
show significantly redder colors than th e average galaxy 
already as early as z ^ 4 — 6 (e.g. [Lee et al.l 120111 : 
IGonzalez et"ani2012al:[Willott et al.ll2012[ ). Thistrend is 
likely caused by an increase in dust extinction at bright 
luminosities, since small amounts of dust have strong 
effects on the rest-frame UV colors (e.g. .Wilkins et al.l 
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TABLE 1 

Data and Sample used in this Paper 



Field Area [arcmin^] 3a Depth [3.6] So" Depth [4.5] Galaxy Sample (total / > Scr in [4.5]) 

HUDF 4.7 27.1 26.8 230 / 59 

HUDF09-2 4.7 26.8 26.4 118 / 23 

GOODS-S 160 26.2 25.7 1635 / 813 

GOODS-N 120 26.2 25.7 660 / 378 



Note. — Depths are total AB magnitudes. 

IMllBouwens et al .'^wH). 

When including Spitzer/IRAC photometry to measure 
rest-fr ame UV-to-optic al colors as a function of UV lumi- 
nosity. iGonzalez et all ((2012ai) and lLee et all ()2011[ ) find 
that these colors are broadly consistent with being just 
due to dust extinction assuming a similar stellar popu- 
lation age at different UV luminosities. Such a measure- 
ment can be significantly biased, however, if the galaxy 
population shows large scatter in dust extinction at a 
fixed stellar mass. An obvious next step is therefore to 
study galaxy colors as a function of rest-frame optical 
luminosity, which is significantly less affected by dust 
extinction and is a much more reliable proxy for galaxy 
stellar mass. 

The main reason that such studies are not readily done 
is the difficulty of measuring accurate IRAC photometry 
for faint high-redshift sources, given the wide IRAC PSF 
of ~ 1'.'6 10 X the HST PSF). Without a neighbor- 
subtraction scheme, only about 30 — 40% of galaxies are 
isolated enough for reliable aperture photometry (e.g. 
iStark et"al] l2009f ). resulting in small sample sizes and 
low number statistics. Several groups have therefore de- 
veloped sophisticated techniques to obtain photometry 
froi n crowde d IR AC data over the last few years (see 
e.g.lLabbe et al.ll200l FCrazian et al.ll2006l : iLaidler et alJ 

Another complication of rest-frame optical studies 
from broad-band IRAC photometry a,t z > A are strong 
emission lines. There is now ample evidence that rest- 
frame optical lines can provide a very significant contri- 
bution to the ob served Spitzer/IRAC fluxes of up to 0.5 
mag (e.g. iSchae rer & de Barros 2009; Shim ct al. 2011; 
Stark et al.l l2012i : iGonzalez et al., i2012a: .Labbe et al.i 
20121) . Such lines have to be taken into account when 
interpreting galaxy colors or when fitting spectral en- 
ergy distributions (SEDs). For instan ce, usi ng synthetic 
models with emission lines iSchaereTfe de Barro'd ()2009f ) 
and Ide Barros et al.l ()2012f ) estimate that the majority 
of galaxies at z ~ 4 — 7 are best fit with extremely 
young galaxy agefl of < 100 IVIyr. This would have 
very important consequences on the total inferred cos- 
mic star-formation rate density. However, it is a concern 
whether the many high-quality photometric constraints 
in the rest-frame UV compared to the few data points 
in the rest-frame optical could be biasing the ages deter- 
mined from standard SED fits. 

A simple way to test such predictions is to study galaxy 
rest-frame UV-to-optical color distributions. In this pa- 
per, we use deep Spitzer/IRAC imaging for a detailed 
investigation of galaxy color distributions of a large sam- 

* If not mentioned otherwise, throughout this paper, we define 
age as the time since the onset of star-formation. 



pie of z 4 galaxies as a function of rest-frame optical 
luminosities. With these data we study the origin of the 
UV continuum reddening of galaxies, and we constrain 
the fraction of galaxies that are consistent with extremely 
young ages (< 100 Myr) based on dust-corrected UV-to- 
optical colors. 

Throughout this paper, we adopt ^Im — 0.3, J^a = 
0.7, Hq ^ 70 kms~^Mpc ~\ i.e. h = 0.7. Ma gnitudes are 
given m the AB system ijOke fc GunnlllQSSf ). 

2. DATA AND GALAXY SAMPLE 
2.1. Observations 

Here we analyze HST and Spitzer data over four 
different fields: the two wide-area GOODS/CANDELS 
fields, the ultra-deep HUDF, and its deep parallel, the 
HUDF09-^|. All these fields are covered by ACS imag- 
ing from -6 4 ^5 to zsso from the UDF05 (PI: Stiavelli 
Oesch et al.l 120071 [2009), the H UDF jB eckwit h et al.l 
2006D and GOODS surveys (Giavahsco^eLaLj |2004a). 
WFC3/IR data in three filters including J125 and Hiqq, 
was obtained as part of the HUDF09 (PI: lUingworth; 
Bouwens et al. 2011), the ERS (PI: O'Conell; Wind- 
horst et al. 2011), and the CANDELS MOT programs 
(PI: Faber/Ferguson; Grogin et al. 2012, Koekemoer et 
al. 2012). For GOODS-North, we only include the first 
six visits obtained until July 19, 2012. The data reach to 
limits of HiQo = 27.0 — 28.5 mag over the ~ 300 arcmin^ 
GOODS fields, and Hieo = 29.3 - 29.9 mag over the two 
HUDF09 fields (covering a total of ~ 10 arcmin^). 

Most import antly, all fields are covered by deep 
Spitzer/IRAC (jFazio et all f2007) imaging in the [3.6] 
and [4.5] channels. In particular, we use new ultra- 
deep data from the 262 h IRAC Ultradeep Field program 
(lUDFlO; PI: Labbe; see also Labbe et al. 2012). This 
survey pushes the IRAC [3.6] and [4.5] imaging to expo- 
sure times of - 120 h and ~ 80 - 120 h over the HUDF 
and the HUDF09-2, respectively. The two wider-area 
GOODS fields were previously covered by 23 h of public 
IRAC imaging (M. Dickinson et al. in prep). 

All the data and samples used in this paper are sum- 
marized in Table [1] 

2.2. Measurements of the z 4 Galaxy Sample 

The galaxy sample analyzed here is identified using 
a standard z 4 Lyman Break Galaxy (LBG) -B435- 
dropout selectiori based on the HST ACS dat a (see, e.g., 
iGiavalisco et al.ll2004bt iBouwens et al.ll2007[ ). Sources 
were identified in a image based on the Vgoej *775, 

^ The second HUDF09 parallel field to the North-East is not 
used due to its lack of -B435 imaging, which is needed for the 2 ~ 4 
LBG selection. 
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and ^850 ACS data, before applying the following color 
selection criteria: 

(^435 - ^4>06 > 1-1) A B435 - ^606 > (^^606 " -^SSo) + 1-1 
A(V'606 - 2^850 < 1-6) 

This selects galaxies at z ~ 3.5 — 4.5, with a mean red- 
shift z = 3.8 and results in a total sample of 2643 galax- 
ies down to Mjjv = — 16 mag over all four fields. HST 
photometry is obtained in small elliptical Kron apertu res 
from PSF matched images (seeE ouwens "eFalllMl for 
details). The IRAC photometry is derived following the 
procedure of iLabbe et al.l ()2010al lbl). In particular, we 
subtract neighboring foreground sources based on their 
profiles in the HST J125 and Hiqq images, after convolu- 
tion to the IRAC PSF. We then perform aperture pho- 
tometry on the cleaned images in 2" diameter apertures, 
and correct to total fluxes using the growth curves of 
nearby stars in the field. The typical corrections are a 
factor 2.2, consistent with expectations from the IRAC 
photometry handbook. A comparison of our photome- 
try against the GOODSMUSIC catalog over GOODS- 
South (jSantini et al.ll2009D shows no bias and a scatter 
of 0.20 - 0.25 mag. 

The automated IRAC cleaning procedure does not al- 
ways work. In particular, if a source is too close to an ex- 
tremely IRAC-bright neighbor, reliable photometry can 
not be extracted. We therefore inspect all sources by eye 
for failures in the neighbor subtraction and flag those. 
These are excluded from the final analysis. With our 
procedure we are able to obtain clean IRAC photom- 
etry for ~ 75% of all sources. This is compared to a 
~ 30 — 40% success rate for simple aperture photometry 
on isolated sources only. We therefore more than double 
the galaxy sample compared to previous analyses which 
omit such a neighbor subtraction. Details on the IRAC 
cleaning and photometry will be presented in Labbe et 
al. (in preparation). 

Since the IRAC flux in [3.6] can be significantly af- 
fected by strong Ha emiss ion for a large fraction of our 
z ~ 3 .5 - 4.5 sample (e.g. iShim et al.l 1201 It iStark et al.l 
l2012f ). we restrict our analysis to IRAC [4.5] band detec- 
tions. This samples rest-frame 9000 A, which is devoid 
of strong line emission. Our galaxy sample consists of 
1273 sources with IRAC [4.5] detections more significant 
than 3(7. 82 of these come from the ultra-deep lUDFlO 
data. 

Using the observed [4.5] fluxes, we compute the rest- 
frame absolute magnitude at 9000 A, assuming a fixed 
redshift for all sources at the mean of our sample z — 3.8. 
In the following, we will denote this as M^. 

One of the main goals of our analysis is to iden- 
tify the origin of the UV continuum reddening in these 
galaxies. To this end, we compute the U V continuum 
slopes, /3 {f\ (X X^), as in iBouwens et al.l ((2012) from a 
power-law fit to the observed fluxes in the HST bands 



606, *775, ^850, ^098/105 



(where available), and J125. 



3. RESULTS 

3.1. Galaxy Colors as a Function of Rest- Frame 
Optical Luminosity 

Our large sample of galaxies with robust IRAC pho- 
tometry allows us to investigate galaxy colors as a func- 
tion of rest-frame optical luminosity. Of particular inter- 
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Fig. 1. — The observed Balmcr break color J125 — [4.5] (sampling 
rest-frame 2500 A— 9000 A) vs. rest-frame optical magnitude for 
2 ~ 4 galaxies. The gray dots represent galaxies with > Scr de- 
tections in [4.5] and > 5cr in J125 (dark gray: lUDFlO, light gray: 
GOODS). The median colors in bins of Mz are indicated by red 
dots with errorbars computed from bootstrap resampling. A clear 
luminosity dependence is evident, with a significant steepening at 
the bright end, where the Balmer break colors become increasingly 
redder. The increase in reddening sets in at around = —21.5, 
which roughly corresponds to L*{z = 4), as estimated based on the 
characteristic magnitude of the UV LF and on the average UV-to- 
optical colors. This is indicated by the vertical black arrow. The 
dashed red line represents two linear fits to the J125 — [4.5] vs Mz 
relation at A/z < —21.5 and Mz > —21.5. Shown as gray and 
black dashed lines are the expected 50% completeness limits of the 
GOODS and lUDFlO datasets, respectively. The horizontal blue 
arrow shows the approximate location of galaxies with UV contin- 
uum slopes /3 = 1 according to the relation we find later in Figure 
[S] The LBG selection volume is significantly reduced for sources 
redder than that. 



est is the J125 — [4.5] color, which straddles the Balmer 
break. At z 4, J125 samples 2500 A, and [4.5] sam- 
ples 9000 A. Note that J125 is chosen over Hieo because 
the latter is contaminated by light longward of 4000 A 
for the lower redshift tail of the B-dropout distribution, 
which complicates its interpretation. 

The J125 — [4.5] color-magnitude diagram is shown in 
Figure [TJ This plot reveals that more luminous galax- 
ies are significantly redder than their lower luminosity 
counterparts. Galaxies at = —22.5 show colors of 
J125 — [4.5] = 1.4, while galaxies at ~ —20 have 
J125 ~ [4.5] = 0.6. In particular, at > -21.5 the 
color distribution is well-described by a Gaussian with 
an observed dispersion of 0.4 — 0.5 mag (see also later 
Figure [5]) . 

Interestingly, a luminosity dependence in the color- 
magnitude diagram is seen at all luminosities. However, 
the dependence is significantly steeper at the brightest 
luminosities. The color-magnitude relation is well repro- 
duced by a two component linear relation, with a break 
roughly at (z = 4|3- The slope of the relation changes 

^'^ L* is crudely estimated based on the average 4775 — [4.5] 
color and the chara cteristic lumin osity of the UV LF at 2 = 4 
of M^y = —21.06 IIBouwens et al.ll2007i ). This results in an es- 
ed 



timated M* = —21.7 mag, which is significantly fainter than the 
characteristic luminosity of the rest-frame V-band LF as measured 
bv lMarchesini et al.1 l|2012l ) who find My 



-22.761E;:« mag. 
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Fig. 2. — Median stacked SEDs of galaxies in bins of 2-band lu- 
minosities. The numbers indicate how many galaxies contribute 
to each stack. Dramatic trends toward redder colors are seen 
as a function of rest-frame optical luminosities, in particular at 
< —21.5. Galaxies are clearly not self-similar as is inferred 
from studies based on binning galaxies in UV luminosities only 
(e.g. Gonzalez et al. 2012a). The open symbols indicate filter fluxes 
which are not used when deriving synthetic SED fits to the stacks, 
since they are expected to change significantly across the extent 
of the LBG redshift selection. The //igo-band probes longward of 
4000 A for the lower redshift tail of the selection, while strong Ha 
emission may contaminate the [3.6] measurement for 50% of the 
sample (at z = 3.8 — 5.0). As can be seen, both these filter fluxes 
are indeed significantly higher than the best-fit SED for all stacks. 

from -0.64 ± 0.02 at L > L* to -0.15 ± 0.05 to L < i*. 

Previous studies that were based on binning galaxies as 
a function of rest-frame UV luminosities suggested that 
the average UV-to-optical SED of galaxies chang es only- 
very lit tle with luminosity. For instance, Gonzalez et al.l 
(|2012aD find rest-frame U — V colors that get redder by 
only 0.026 per mag in UV luminosity. Such weak trends 
were in good agreement with the suggestion from SPH 
simulations that found star-formatio n histories of ga lax- 
ies to be essentially self-similar (Finl ator et al.|[2'01lD . 

Our findings above suggest that using rest-frame op- 
tical luminosities to bin and stack galaxy SEDs, much 
more significant trends should become apparent. This is 
indeed the case as shown in Figure [21 where we present 
SEDs in bins of Mz- A significant increase in reddening 
toward brighter z-band luminosities is obvious in these 
stacks. While these SEDs span 3 mag in M^, they only 
vary by ~ 1.5 mag in UV luminosity. Furthermore, the 
stacked SEDs show a clear correlation between the UV 
continuum slope and the UV-to-optical color. This indi- 
cates that galaxies that are luminous in the rest-frame 
optical are significantly more dust obscured than fainter 
systems, which is likely the origin for the steeper trend 
in the J125 — [4.5] color we found in Figure [TJ 

Finally, from these stacks it is clear that Ha fiux con- 
tamination is indeed a potential problem for our B- 
dropout sample. All stacked SEDs show a significant 
excess at 3.6 /im compared to the best-fit SED, ranging 
from 0.06 mag to 0.14 mag, with a tentative trend to 
a larger excess at fainter luminosities. Since Ha is only 
present in the [3.6] filter sX z = 3.8 — 5.0 (i.e. in ^ 50% 
of our redshift selection), these numbers are roughly con- 
sistent with the m ean excess of 0.27 mag as inferred by 
iStark et "all (|2012() . in particular for fainter sources. 



A potential problem with the above measurements are 
the combination of different selection effects that can re- 
sult in biased distributions. In Figure [U we show the es- 
timated completeness limits of our data as dashed lines. 
The vertical limit is due to the IRAC S/N cut, while 
the slanted line is mostly caused by the Ji25-band cut, 
but its slope is influenced by the correlation between 
J125 — [4.5] vs /3 (see Figure [3]) due to the LBG selection. 
The completeness lines are estimated based on simulat- 
ing a large number (10^) of galaxies with synthetic SEDs 
with Mz = —24 to —18, with UV slopes in the range 
/3 = — 3 to 1, and with redsh ifts according to our r edshift 
selection function (see e.g. iBouwens et al.l [20071) . The 
fluxes of these SEDs are then perturbed with Gaussian 
dispersions and all the selection criteria are applied, from 
which we compute the completeness as a function of AIz 
and J125 - [4.5]. 

Clearly, at faint luminosities, we are potentially biased 
against red sources in the GOODS data. However, the 
Ji2b — [4-5] color distribution peaks at sufliciently bluer 
colors than the completeness limit. Using the simulations 
above, we estimate that the observed mean color at Mz = 
—20 is not signiflcantly biased due to selection effects 
(< 0.05 mag). 

Finally, photometric errors in [4.5] might bias the mea- 
surement of the color-magnitude relation of Figure [1] 
since both axes depend on the [4.5] fiux measurement. 
A positive error in [4.5] would result in a brighter Mz 
with a redder J125 — [4.5] color. We therefore checked 
and confirmed that an analogous steepening in the color- 
magnitude relation is still seen if the absolute magnitude 
derived from the [3.6] photometry is used instead of Mz- 
This relation is not shown here due to the possible flux 
contamination from strong Ha lines, as explained earlier. 
Nevertheless, this test suggests that the steepening in the 
color-magnitude relation is not significantly affected by 
scatter in the [4.5] flux measurements. 

3.2. UV Continuum Slopes and Balmer Breaks 

In order to investigate the origin of red UV contin- 
uum slopes P in brighter galaxies, we study the relation 
between /3 and the J125 — [4.5] color of z 4 galaxies, 
which effectively compares the reddening in the UV with 
the Balmer break. 

As shown in Figure [21 there is a clear and relatively 
tight relation between the UV slope /3 and the J125 — 
[4.5] color, with UV redder galaxies showing redder colors 
across their Balmer break. The relation is well-fit by the 
linear relation: 

J125 - [4.5] = (0.64 ± 0.03) X /3 + (1.72 ± 0.04) (1) 

where we have included all galaxies with J125 detections 
> 5cr, irrespective of their [4.5] detection significance. 
Excluding sources with non-detections in the IRAC [4.5] 
band results in a relation with an identical slope, but 
with a slightly redder zeropoint (1.86±0.04). 

For sources with /3 < —1.5, the color dispersion per- 
pendicular to the dust reddening is only 0.4 mag, which 
increases to 0.5 — 0.6 mag for redder sources. These dis- 
persions are most likely explained by a variation in stellar 
population ages as indicated by the age vector in Figure 
[5] (see also Sectio n 13.41). 

Interestingly, a lCalzetti et al.l ()2000D dust curve would 
lead to a steeper relation than what is seen in the data. 
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Fig. 3. — The relation between the UV continuum slope /3 and 
the observed J — [4.5] color for 2 ~ 4 galaxies. As in Figure [T] 
the filled dots correspond to sources with > 5cr detections in J125 
and S/N([4.5])> 3. Dark and light grey dots denote galaxies in the 
lUDFlO and GOODS-North/South respectively. Open circles cor- 
respond to lower significance IRAC fiuxes (< 3cr) in either sample. 
The mean stacked color (including the measured fluxes of IRAC 
undetected galaxies) in bins of /3 are indicated as red dots with 
errorbars corresponding to the uncertainty on the mean. The red 
dashed line indicates a linear fit to these stacked colors. The cor- 
relation between /9 and the J — [4.5] color is primarily driven by 
dust extinction, as shown by the vectors in the lower left. CA in- 
dicates the Galzetti et al. 2000 star-burst dust reddening vector 
with Ay = 0.5 mag, and SMG represents the Pel 1992 reddening 
vector with Ay = 0.25 mag. Also shown is an age vector which is 
derived for an SED with constant star-formation and an age from 
0.1 to 1 Gyr. 
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This can be seen from the dust reddening vector shown in 
the lower left of Figure [3l Galzetti dust results in a slope 
of 1.29 and is clearly inconsistent with the relation given 
above. This means that the correlation between /3 and 
J125 — [4.5] can not simply be explained by Galzetti dust 
reddening only, but would require that dustier galaxies 
show younger stellar population ages in a correlated man- 
ner which results exactly in the observed slope. 

A simpler explanation of the J125 — [4.5] vs /3 correla- 
tion, however, is that galaxies are instead r eddened by 
SMG-like dust. Using the parametrization of iPeil ()1992f ) 
SMG reddening results in a slope of 0.66, which is in 
excellent agreement with the observed relation (see also 
the reddening vector in Figure [3|). Our data therefore 
provides tentative evidence that SMG reddening might 
be more appropriate for galaxies at z ^ 4. At least 
for brighter galaxies, this is in agreement with recent 
measurements of the infrared-ex cess for z ^ 4 sources 
with Herschel (|Lee et al.l l2012al ) . At fainter luminosi- 
ties, the Herschel re sults are consistent with th e standard 
iMeurer et all (|1999D and lGalzetti et al.l (|2000D extinction 
instead. 

We therefore test whether we find a luminosity depen- 
dence in the J125 — [4.5] vs (3 relation in our data. In 
Figure |4j we show this relation for three bins of z-band 
luminosity, down to Mz < —19.75, which is the com- 
pleteness limit of the GOODS IRAG data. 

As is evident, the brightest galaxies with < —21.75 
show a very large range of UV continuum slopes from 
/3 ~ — 2 to ^ ~ 0. Across this full range, their J125 — [4.5] 



Fig. 4. — The relation between the UV continuum slope f3 and 
the observed J — [4.5] color for 2 ~ 4 galaxies as a function of 9000 
A luminosity. The dashed red line is the mean relation derived for 
all galaxies (Equation [TJ and is repeated in every panel. As in Fig- 
ure [3] the filled dots correspond to sources with > 5(t detections in 
J125 and S/N([4.5])> 3. Dark and light grey dots denote galaxies 
in the lUDFlO and GOODS-North/South respectively. Open cir- 
cles correspond to lower significance IRAC fluxes (< Scr) in either 
sample. The mean stacked color (including the measured fluxes of 
IRAC undetected galaxies) in bins of fB are indicated as red dots 
with errorbars corresponding to the uncertainties on the means. 
Only bins with more than 15 sources to stack are shown. 



colors follow the observed slope for the full sample ex- 
tremely well, consistent with the Herschel results and 
with a reddening of SMG-like dust. However, the col- 
ors of these bright sources are systematically shifted by 
about 0.3 mag to the red, likely due to systematically 
older stellar population ages compared to the average 
galaxy in our sample. The fact that there are almost 
no bright galaxies with (3 < —2 indicates that the most 
luminous (and thus the most massive) galaxies at z ~ 4 
must have formed both their stars and their dust over an 
extended period of time. 

As we move to lower luminosities, we find that the 
mean UV slope clearly shifts to bluer, more dust-free 
colors. Interestingly, in the faintest bin (M^ ~ —20.25) 
the J125 — [4.5] color appears to be independent of (3. 
However, this is potentially due to incompleteness, which 
starts affecting galaxies as blue as J125 — [4.5] = 0.9 in 
that bin (see Figure [1]). 
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3.3. Dust- Corrected UV-to-Optical Colors 

The goal of our study is to constrain the intrinsic dis- 
tribution of Balmcr break colors in the z 4 LBG popu- 
lation. Having found the correlation in Figure |31 we have 
a means of empirically correcting the J125 — [4.5] colors 
for dust reddening. Our fiducial dust correction proce- 
dure is as follows: Starting from the observed J125 — [4.5] 
and /? values for galaxies, we find the intersection of the 
reddening vector with the track of a synthetic SED model 
with constant star-formation and with ages in the range 
10'' — 2 X 10^ yr. For the reddening vector, we use the 
empirical slope of Equation [l] which is essentially equiv- 
alent to SMC reddening. 

This procedure is chosen to more accurately represent 
the likely dust extinction in young galaxies, which typ- 
ically have bluer UV continuum slo pes t han th e value 
of -2.23 for which the lMeurer et all fM99: ll999D correc- 
tion results in zero dust extinction. The intrinsic, dust- 
corrected /3 values of our fiducial approach lie in the range 
—2.8 to —2.3. In the appendix we show a schematic fig- 
ure which explains our dust correction (Figure I10[) , and 
we show the effect of using the standard M99 dust cor- 
rection for comparison. 

Histograms of both corrected and observed colors are 
shown in Figure [5l again for three bins of z-band luminos- 
ity down to Afz = —19.75. Our fiducial dust-correction 
results in average offsets of 0.8 mag at < —21.75, 
and only 0.25 mag at — —20.25. These offsets in 
</i25 — [4.5] colo r corre spond to Ay = 0.41 mag and 0.13 
mag for the lPeil ()1992f ) SMC extinction law, respectively. 

While the original colors were strongly luminosity de- 
pendent, the corrected J125 — [4.5] colors show only 
a weak trend with luminosity. The brightest sources 
(Mz < —21.75) are only 0.2 mag redder on average 
than galaxies at > —21.75. Note that these his- 
tograms are well-sampled. From bright to faint, the three 
luminosity bins in Figure [5] contain 234, 438, and 545 
galaxies, respectively. 

For the two lower luminosity bins, we also show the 
color histograms for blue sources with /3 < —2, for which 
the dust corrections are smallest. This provides a good 
consistency check. Overall, sources with /3 < —2 com- 
prise 30% of the sample at < —19.75 (326 galaxies). 
However, at < —21.75 this f3 < —2 sample contains 
only 16 sources, preventing a reliable comparison (which 
is why these are omitted from Figure ^ . In both fainter 
bins, the /3 < — 2 sample shows de-reddened color dis- 
tributions in excellent agreement with the full sample, 
indicating that our dust-correction procedure is reliable 
and unbiased. 

3.4. Limits on the Predominance of Extremely Young 
Calaxy Ages 

Several papers have recently claimed that a dominant 
fraction of z > 4 galaxies may exh ibit very young stellar 
popu l ation ages (< 100 My r; e.g. iSchaerer k, de BarrosI 
I2010t Ide Barros et al.l 120121 ). This would have very im- 
portant consequences, since it would imply extremely 
short duty cycles for LBGs, and large c orrections to 
the c osmic star-formation rate density (e.g. iJaacks et al.l 
[Ml). One major difficulty in inferring stellar popula- 
tion ages from SED fitting is the degeneracy between 
dust extinction, age, and metallicity in reproducing the 
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Fig. 5. — Histograms of dust-corrected colors (J125 — [4.5])acorr 
in three bins of rest-frame optical absolute magnitude at 9000 A 
(thick red line). The underlying, shaded blue histograms represent 
only the blue galaxy population with UV-continuum slopes /3 < —2 
for which the dust correction is small. Also shown as thin, orange 
lines are the un-corrected color histograms for the full population, 
indicating that the dust-correction is much more significant for 
brighter galaxies. The dots with horizontal error bars represent the 
median and ~ Icr range of the distribution. These indicate that 
unreddened galaxy colors show weak trends with rest-frame optical 
luminosity, but that the width of the distribution does not change 
dramatically. Vertical dashed lines indicate the colors of SEDs 
with constant star-formation rates and stellar population ages of 
0.1 and 1 Gyr, which enclose the majority of the observed colors. 
The fraction of galaxies that are consistent with very young ages 
(< 0.1 Gyr) is indicated on the left in each panel. This amounts 
to only 12% when corrected for photometric scatter and averaged 
over all galaxies with < —19.75 mag. Note, however, that the 
exact number depends on the assumed star-formation history. 

reddening of the UV slope. These problems are exacer- 
bated at z > 4, since HST only probes rest-frame UV 
wavelengths, and the only rest-frame optical information 
typically comes from only two Spitzer bands. 

Given these complications, it is thus instructive to use 
assumptions as simple and model independent as possi- 
ble to test what range of ages is consistent with the data. 
This can be done using our simple dust-corrected Balmer 
break colors (J125 — [4.5])^corr- In Figure[51 we indicate 
the colors of SEDs with constant star -formation and ages 
10^ yr and 10^ yr. These are based on lBruzual fc Chariot] 
(2003) models, with Salpeter IMF and sub-solar metal- 
licities {Z = 0.2Zq). As can be seen, these lines nicely 
bracket the peak of the color distribution, indicating that 
the average z ^ 4 LBG shows stellar population ages in 
this range. 

The fraction of sources with colors consistent with very 
young ages below 10* yr (i.e. with (J125 — [4.5])^corr < 
-0.19) is very small; less than 10% at < -20.75 and 
only 21% at AI^ — —20.25. Integrated over all galax- 
ies with Mz < —19.75 for which we are complete, the 
number of galaxies with (J125 — [4.5])^corr < —0.19 is 
14%. A simple correction of this number for photomet- 
ric scatter suggests that this fraction intrinsically is even 
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Fig. 6. — The color distribution for all galaxies with Mz < 
— 19.75, together with three different model distributions. The 
filled histograms show the dust-corrected J125 — [4.5] colors from 
the previous figure. The two thin lines show the best-fit color dis- 
tributions when assuming a Gaussian distribution in log(age) and 
two different SFIfs (CSF: constant star-formation, RSF: a power 
law rising SFR f^ '^). Both reproduce the color distributions fairly 
well. However, they fall somewhat short at th e bluest and red - 
dest tails of the distribution. The colors of the IGuo et aLl II2011I ) 
SAM are shown as thick purple line. These result in a color distri- 
bution in excellent agreement with the observed corrected colors, 
indicating that the SAMs predict SFHs that represent reality very 
well. 

smaller, i.e. 12%. The formal uncertainty on this num- 
ber is only 1%. However, we note that this fraction is 
highly model dependent. In particular, using an SED 
model with rising star-formation rates, the young frac- 
tion is significantly reduced, due to bluer galaxy colors at 
a given age. Additionally, in the appendix we estimate 
that using the standard M99 and iCalzetti et"all (|2OOO0 
dust relations and SEDs with constant star-formation 
would result in a young fraction of 19%. Despite of these 
uncertainties, the observed color distributions indicate 
that the galaxy population may only contain a relatively 
small number of extremely young sources, in contrast to 
earlier findings in the literature based on full SED fitting. 

3.5. Fits to the Color Distribution at z ^ A 

Using our corrected color distribution, we can set in- 
teresting constraints on the ages and star-formation his- 
tories (SFHs) of z ~ 4 galaxies. In particular, we can fit 
for the distribution of galaxy ages under the assumption 
of smooth SFHs. This is done as follows: We sample 
galaxy ages from a normal distribution in log(age) with 
mean ha and width a a- We then compute the expected 
galaxy J125 — [4.5] color for the given age, assuming either 
a constant star-formation rate (CSF) or a power-law ris- 
ing star-formation history (t^-^, RSF). The latter is the 
population a yerage SFH consistent with the evolution of 
the UV LF (jPapovich et al.ll201lD . The photometry is 
then perturbed using the real flux uncertainties of our 
galaxy sample, and the resulting histogram is compared 
to the data using a Poissonian likelihood of the number 
counts. 

The best-fit color distributions are shown in Figure 
[HI These correspond to Gaussian age distributions with 
\og(age) = 8.57 and ciogage = 0.50 for constant star- 
formation and \og(age) = 9.00 and uiogage = 0.56 for 
the rising star-formation histories. The somewhat older 



^-0.5 



1 1 1 1 

without nebular continuum 


1 


with nebular continuum . 




84% y 




50% y'yy 




16% ^-^'C^'^^ / 






75% 




- '/■/ 




^4/ RSF CSF 2 component SED 

' 1 1 1 1 1 



'6.5 7 7.5 8 8.5 9 9.5 

log age (mass weighted) [yr] 

Fig. 7. — The J125 - [4.5] color evolution of different SFHs as 
a function of mass-weighted age. The star-formation histories are: 
a power-law en t^-'^ (RSF), constant star- formation (CSF), and a 
two-component SFH with a varying mass-fraction of an extremely 
young stellar population (of age 20 Myr) on top of an older popu- 
lation (of age 1.6 Gyr). The location of 25%, 50%, and 75% young 
mass fractions are indicated by small dots. Solid lines are for pure 
stellar continuum, while dashed lines include the UV reddening due 
to nebular continuum emission from hydrogen and helium recombi- 
nation in the inter-stellar medium of a galaxy (assuming ionization 
equilibrium and an escape fraction of 50% for ionizing photons). 
The shaded gray area spans 68% (~ Icr) of the observed galaxy 
color distribution at < —19.75 mag (see Figure [Sjl. The ma- 
jority of galaxies clearly has less than a 25% contribution from 
extremely recent bursts, and sources are consistent with a mass 
weighted stellar population age between 50 and 500 Myr. 

mean stellar population age for the latter SFH is a result 
of bluer galaxy colors at a fixed stellar population age, 
given the dominance of younger stars. 

Again, for both these distributions, we find a very small 
number of galaxies with ages younger than 100 JVIyr (only 
13% for CSF and 4% for RSF). Interestingly, however, 
these distributions predict that a significant fraction of 
galaxies (50% for CSF and 24% for RSF) would not have 
been forming stars for longer than 400 IVIyr, which is the 
time between z ~ 5 and z A. These sources would 
therefore not have been present in the higher redshift 
LEG sample. The mean formation redshift of our sample 
is inferred to be z/ = 5.0 or Zj = 9.3 assuming CSF or 
RSF, respectively. 

While the Gaussian age distributions for both assump- 
tions on the SFHs do reproduce the observed color his- 
tograms rather well, they fall somewhat short at the 
bluest and reddest tails of the distribution. This indi- 
cates that real star-formation histories contain a modest 
component of bursty star-formation. 

We can obtain a first constraint on the contribution 
of extremely short bursts to the total star-formation of 
galaxies from the expected J125 — [4.5] color for a two- 
component SED with a very young population added to 
an older underlying one. This is shown in Figure [T) The 
Balmer break color saturates very quickly close to the 
color of the young component, even for a small mass 
fraction (< 10%) produced in a recent burst. Bursty 
star-formation would therefore significantly tilt the color 
distribution to young galaxy ages, even if only a small 
mass fraction is built in bursts. The exact numbers de- 
pend on the detailed assumptions on the underlying old 
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population and burst ages, but this suggests that the 
majority of galaxies is consistent with only a very small 
fraction of stars being formed in very recent bursts. 

A further constraint on bursty star-formation can be 
obtained from the equivalent width distribution of strong 
rest-frame optical lines. Based on galaxies w ith spectro- 
scopic redshifts in the range z = 3.8 — 5.0, iStark et all 
(|2012D estimated the combined Ha-h[SII]-F[NII] equiva- 
lent widths from the IRAC [3.6] flux excess. They find 
that this distribution peaks at log(VF/A) = 2.57 and 
has a dispersion of 0.25 dex. We can estimate the pre- 
dicted equivalent widths from our age distribution us- 
ing a simple model for hydrogen recombination. As- 
suming a 10% escape fraction of ionizing photons (con- 
sistent with the m ost recent estimates for LBGs from 
INestor et al.ll20l"^ and photoionization equilibrium, we 
ca n derive the Ha emission line strengths from the tables 
in lOster brock fc Ferlandl ()2006D and from the total num- 
ber of ionizing photons that are produced in the SED 
model. This calculation shows that our age distributions 
result in equiv alent widths that a re lower than the ob- 
served values of lStark et al.l (J2012) by only ^0.15 dex, in- 
dicating that such smooth SFHs can produce most, but 
not all of the rest-frame optical line strengths. Bursty 
star-formation is thus certainly present to produce the 
most extreme tail of Ha emission, but it is likely not 
dominant. 

Furthermore, we can use these calculations to check 
what fra ction of the equiva lent width distribution mea- 
sured bv IStark et al.l (|2012[ ) requires young galaxy ages. 
For an SED with an age of 100 Myr, we derive a com- 
bined Ha-H[SII]4-[NII] equivalent width of 480 A and 
790 A for our CSF an d RSF SFH models, respectively. 
The IStark eFall (|2009[) distribution contains ~ 30% and 
~ 10% of galaxies with larger values. This provides fur- 
ther evidence for relatively small fractions of very young 
galaxies in LBG samples. 

In order to compare our observ ations with more realis- 
tic SFHs, we use the resuhs of the lGuo et al.l (|20lH ) semi- 
analytical model (SAM) based on the Millennium-II dark 
matter simulation. In order to extract the J125 — [4.5] 
colors of the SAM, we fit SEDs to the dust-free abso- 
lute magnitudes of their simul ated galaxies at z = 3.8 
with iBruzual fc Chariot! ()2003l ) templates, and extract 
the J125 and [4.5] magnitudes of the best-fit SEDs. These 
intrinsic colors are then perturbed by the flux errors from 
our observational data. The result of this is shown as 
thick purple line in Figure [6l Obviously, the predicted 
color histogram is in very good agreement with the ob- 
served one, indicating that this SAM predicts SFHs that 
are a ve ry good match to real galaxies. Note, however, 
that the iGuo et al.l ()201lD model does not do as well in 
reproducing the dust-reddened (i.e. directly observed) 
color distributions. Furthermore, their model falls short 
of bright galaxies i n the LFs at blue wavele ngths at z > 2. 
As argued in, e.g.. iHenriques et al.l ([2012) both these ef- 
fects are likely connected to uncertainties in the dust 
modeling. Analyses similar to the ones in our paper will 
help to further test and refine such models. 

4. DISCUSSION 

The large sample of galaxies with reliable IRAC pho- 
tometry analyzed in this paper allowed us for the first 
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Fig. 8. — (Top) The parameters of the best-fit age distribu- 
tions as a function of rest-frame optical luminosities. Mean galaxy 
ages are increasing only by 35% (60%) from Mz = —20.25 to 
Afz = —22.25, while the width of the age distribution decreases 
slightly from a ~ 0.55 at faint luminosities to tr ~ 0.3 — 0.4, de- 
pending on the assumed SFH. (Bottom) The fraction of galaxies 
with ages younger than 100 Myr based on the inferred age distri- 
butions. Clearly, the brightest galaxies are not dominated by short 
bursts with < 5% showing galaxy ages younger than 100 Myr. For 
constant star-formation histories, this increases to 15% at lower 
luminosity, while it stays rather small for rising SFHs. 



time to study the correlations between UV and optical 
colors at z ~ 4, as a function of optical luminosity. The 
relatively tight correlation we find between the J125 — [4.5] 
color and the UV slope /3 in Figure [3] provides direct ev- 
idence that the main driver for UV reddening in LBG 
samples is dust extinction. 

The shallow slope of this r elation is particularly 
intriguing, since Calze tti et al.l ()2000l ) dust reddening 
would result in significantly steeper trends. This could 
in principle be the result of correlated changes in galaxy 
ages with dust extinction, such that younger galaxies are 
more heavily obscured. However, a more simple expla- 
nation is that dust extinction at 2 4 is more accurately 
described by an SMC-like dust curve. 

Certainly, our measurements are not ideal to determine 
the exact shape of the reddening law. However, the fact 
that the brightest galaxies do follow a J125 — [4.5] vs /3 
trend that is parallel to the mean trend but offset by a 
constant value strongly suggests that the correlation is 
driven by SMC dust, and that the offset is due to older 
stellar population ages compared to the mean. If this is 
confirmed in future analyses, the use of SMC-like dust 
would result in small corrections of the reported SFRs 
for 2; > 4 galaxies in the literature. 

While both local Lyman Break Analogs and the ma- 
jority of bright LBGs at z ~ 2 — 3 are more consistent 
with Calzetti dust extinction (e.g. lOverzier et al.ll2011l : 
iReddv et al1l2012l l2blO). SMC-like extinction has been 
found to better reproduce the infrared exce ss of young 
low-l u minos i ty galaxies a t z 2 — 3 (e.g. iSiana et all 
l200l. I2OOI IReddv et al.l 120121 ). On the other hand, 
direct measurements of the IR excess with Herschel 
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of z '~ 4 galaxies suggests an inverted luminosity de- 
pendence, with brighter galaxies having SMC-like dust, 
while fainter gala xies are found to follow the M99 relation 
(|Lee et alJl2012al ). 

Clearly, the appropriate dust curve is still an open 
question. Solving this problem will require a multi- 
wavelength approach using complete samples of galax- 
ies with unbiased spectroscopic redshifts, and combining 
deep HST and Spitzer data with Herschel and ALMA 
observations. Further progress should thus be possible 
in the very near future. 

For now, we use the J125 — [4.5] vs /3 relation to empir- 
ically correct the Balmer break color for dust extinction. 
While the observed J125 — [4.5] colors are significantly 
redder for the brightest sources, the dust-corrected col- 
ors only show a relatively weak dependence on rest-frame 
optical luminosity. A residual 0.2 mag difference between 
galaxies at Mz < —21.75 and fainter sources is easily ex- 
plained by a small increase in the stellar population age 
by only ^0.2 dex. 

Dust extinction is clearly the main contributor to red 
galaxy colors even at the brightest luminosities, and the 
steepening of the color-magnitude relation seen in Fig- 
ure [1] is the result of a significant increase in the dust 
reddening in the most massive galaxies. This could have 
important consequences for the interpretation of the UV 
LFs, since increased dust extinction can effectively sat- 
urate the UV lumin osities (see e.g. iReddv et al.l 120081 : 
iBouwens et"ani2009f ). 

The steeper color-magnitude relation at L > L* 
is an indication that the most massive galaxies pro- 
duce a significantly higher dust mass relative to their 
stellar mass than sub-L* galaxies, or that they can 
more easily hold on to the dust produced in their su- 
pernovae compared to lower mass system s (see also 
Bouwens et al. 2012; F inkelstein et al.ll2012| ). Ahhough 
Finkelstein et al.. (2012i) only rely on rest-frame UV data 
to infer stellar masses, they find that the most mas- 
sive galaxies show surprisingly red UV continuum slopes, 
which would require significant amounts of dust to be 
present as early as z ^ 7. A more robust exploration 
of this effect is now possible based on the large samples 
with IRAC photometry (Gonzalez et al., in preparation). 

In contrast to the strong trends of dust extinction, the 
galaxy ages only vary weakly with luminosity. The in- 
ferred mean galaxy ages increase by only 35% to 60% 
across 2 mag in z-band luminosity, depending on the as- 
sumed SFH (see Figure [8|) . This is derived using the 
same simulations as described in Section 13.51 The best- 
fit distributions show that there are essentially no bright 
galaxies with colors consistent with being dominated by 
very young bursts. However, this fraction appears to be 
increasing toward fainter luminosities (lower panel Fig- 
ure [8]), which can further be explored in the near future 
with the deeper S-CANDELS data. 

Unfortunately, current simulations of high-redshift 
galaxies mostly focus on the sub-L* population, due to 
the very large volumes necessary to sample the bright- 
est galaxies. However, the very steep increase in dust 
extinction in > L* galaxies which we find in our data 
presents an interesting test for galaxy evolution mod- 
els. For example, we show that the corrected J125 — [4.5] 
color d istribution is almost perfectly reproduced by the 
iGuo et al. (2011.) SAM models, indicating that this SAM 



is reliably predicting galaxy SFHs. However, these same 
models fail in accurately capturing the observed steep- 
ening in the color magnitude relation and in predicting 
the UV LF and its evolution. Both these effects are 
likely connected to the uncertain treat ment of dust ex- 
tinction in such models (s ee e.g. iGonzalez-Perez et al.l 
l20ll:IHenriaues et al.l[2012h . 

5. SUMMARY 

In this paper, we combined ultra-deep HST and Spitzer 
imaging to study rest-frame UV-to-optical colors of 
galaxies at z ^ 4. In particular, we perform a so- 
phisticated neighbor subtraction in the IRAC images to 
obtain a sample of 1273 B-dropout galaxies with clean 
IRAC [4.5] detections more significant than 3(T from two 
fields of the HUDF09 survey as well as from GOODS- 
North and South. The [4.5] channel data is preferred 
over the shorter wavelength [3.6] photometry in order to 
avoid any flux contamination by str ong Ha emission (e.g. 
IShim et al.l[20Tlt IStark et al.l[2012l) . 

Using this sample we find: 

• The observed J125 — [4.5] color is strongly depen- 
dent on rest-frame optical luminosity. In particu- 
lar, the relation steepens significantly for galaxies 
with Mz < —21.5 compared to the lower luminosity 
trend (see Figured]), indicating that the most mas- 
sive galaxies produce and retain more dust relative 
to their stellar mass than lower mass galaxies. 

• A relatively tight correlation is found between 
J125 — [4.5] and the UV continuum slope P (see Fig- 
ure [3]). The most simple explanation for this slope 
is that the UV-to-optical SEDs of z 4 galaxies are 
better reproduced by an SMC-like dust extinction 
rather than the standard Calzetti dust curve. This 
result will have to be confirmed in the future by 
combining large samples of galaxies with spectro- 
scopic redshifts with additional longer-wavelength 
data to constrain the amount of dust extinction. 

• The dust-corrected colors of galaxies are only 
weakly dependent on luminosity, and their distri- 
bution is well reproduced by a Gaussian in log(age) 
of galaxies, with a mean log(age)= 8.57 or 9.00, 
depending on the assumed star-formation history 
(either constant or rising with time). 

• Only a very small fraction of 12 — 19% of galax- 
ies shows colors that are consistent with extremely 
young galaxy ages, below 100 Myr. The exact num- 
ber of this fraction is dependent on the assump- 
tions about the star-formation histories of galaxies. 
However, the vast majority of galaxies is consistent 
with a smooth build-up and only a negligible frac- 
tion of their stars being formed in very recent burst. 

This paper gives a brief preview of the power of com- 
bining HST and Spitzer observations for studying galax- 
ies at z > 4. After completion of the still on-going S- 
CANDELS program, these types of analyses can be ex- 
tended to cover more than twice the area of our current 
sample to a depth of 50 h (about 0.4 mag deeper than 
the GOODS IRAC data used here). These data sets 
will allow us to constrain SFH and age distributions of 
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large samples ol galaxies at z > 3. In particular when 
combined with additional constraints on dust extinction, 
e.g. from Herschel or ALMA, and with spectroscopic 
redshifts from the new and upcoming multi-object NIR 
spectrographs. This should allow for the next big steps 
forward over the next few years in working toward a self- 
consistent picture of star-formation and mass build-up 
before the peak of cosmic star-formation. 
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APPENDIX 

A. THE EFFECTS OF USING THE MEURER ET AL. 1999 DUST CORRECTION 

As we show in the main text of this paper, an SMC like dust curve appears to be more consistent with the J125 — [4.5] 
vs /3 relation (see Figure [3]). This motivated our use of an SMC slope to correct galaxy colors for dust. Howe ver, for 
consis tency with much of the previous literature, we also show the effe cts of using the comb ination of the Meur er et al.l 
(|1999f ) relation between Auv and the UV slope /3 together with a ICalzetti et al.l (|2OOO0 dust curve. The Meurer 
corrected color histograms are shown in Figure |9l 

As can be seen, this approach results in dust corrected colors that are bluest at the brightest luminosities. Further- 
more, the corrected colors for the bluest galaxies with /? < —2 are significantly redder than for the full population. 
If this were true, this would indicate that the bluest galaxies are the oldest in the sample, which is not expected. 
Therefore, the combination of Meurer and Calzetti dust likely over-corrects the J125 — [4.5] colors. 

Despite of the larger correction, the integrated fraction of galaxies with corrected colors bluer than what is expected 
for a CSF population at 100 Myr is still only 19%. 



lUDFlO: Rest-frame Optical View on z ~ 4 Galaxies 



11 




Fig. 9. — Same as Figure[5l but showing the dust-corrected colors when using the lMeurer et aLl II1999I ) Ajjy — P relation and the Calzetti 
dust curve. For comparison, the dashed line shows the histogram for our fiducial dust correction as described in the main text. As can 
be seen, using the Calzetti dust correction results in dust corrected colors that are bluest at the brightest luminosities. Furthermore, the 
corrected colors for the bluest galaxies with f) < —2 are significantly redder than for the full population. If this were true, this would 
indicate that the bluest galaxies are the oldest in the sample, which is not expected. Therefore, the combination of Meurer and Calzetti 
dust likely over-corrects the J125 — [4.5] colors. Nevertheless, the integrated fraction of galaxies with corrected colors bluer than what is 
expected for a CSF population at 100 Myr is only 19%. 




UV continuum slope p 



Fig. 10. — Schematic figure showing our fiducial dust correction procedure. The solid line shows the track of a lBruzual &: Charlotl l|2003 ) 
SED model with constant star-formation at 0.2Zq. For every galaxy we start from the observed location in the /J vs J125 — [4.5] plot (red 
dot), and we find the intersection of the negativ e reddening vector with the SED track (filled bl ue dot) to determine the dust-corrected 
J125 — [4.5] color. For comparison, the standard IMeurer et al.i ((i999|) correction together with a lCalzetti et al.l H2000I ) dust curve is also 
shown by the dark gray vector, which would result in the open blue dot. In particular for very UV-red galaxies the difference between 
the two dust corrections can be significant. The M99 relation assumes a dust-free UV slope of ^ = —2.23 (vertical dashed line), which is 
substantially redder than the values measured for star-forming SEDs with ages < IGyr. For the few galaxies that lie to the left of the SED 
track, we do not apply any dust correction to the color. The red dashed line represents the mean relation between /3 and J125 — [4.5] of 
Equation [1] shown in Figure [S] 



